Multi-Syllable Phonotactic Modelling
نویسنده
چکیده
This paper describes a novel approach to constructing phonotactic models. The underlying theoretical approach to phonological description is the multi-syllable approach in which multiple syllable classes are deened that reeect phonotactically idiosyncratic syllable subcategories. A new nite-state formalism, ofs Modelling, is used as a tool for encoding, automatically constructing and generalising phonotac-tic descriptions. Language-independent prototype models are constructed which are instantiated on the basis of data sets of phonological strings, and gener-alised with a clustering algorithm. The resulting approach enables the automatic construction of phono-tactic models that encode arbitrarily close approximations of a language's set of attested phonological forms. The approach is applied to the construction of multi-syllable word-level phonotactic models for German, English and Dutch.
منابع مشابه
A Language Independent Approach To Acquiring Phonotactic Resources for Speech Recognition
Building and developing linguistic resources for languages is of prime importance with many areas of application. This paper focusses on a fully automatic approach to the aquisition of a syllable phonotactics for a particular language. In this approach the phonotactic constraints for a language are encoded in a finite-state phonotactic automaton the structure of which can be automatically deriv...
متن کاملImproving Syllabification Models with Phonotactic Knowledge
We report on a series of experiments with probabilistic context-free grammars predicting English and German syllable structure. The treebank-trained grammars are evaluated on a syllabification task. The grammar used by Müller (2002) serves as point of comparison. As she evaluates the grammar only for German, we reimplement the grammar and experiment with additional phonotactic features. Using b...
متن کاملPhonotactic and prosodic effects on word segmentation in infants.
This research examines the issue of speech segmentation in 9-month-old infants. Two cues known to carry probabilistic information about word boundaries were investigated: Phonotactic regularity and prosodic pattern. The stimuli used in four head turn preference experiments were bisyllabic CVC.CVC nonwords bearing primary stress in either the first or the second syllable (strong/weak vs. weak/st...
متن کاملOn the syllable structures of Chinese relating to speech recognition
It is well known that Chinese is a tone language with multi-tone system, but the distinctive syllable structures relating to speech recognition have not brought to phoneticians' attention yet. The syllable structures, the phonotactic rules were discussed and the joint probability of the initials and the finals were given in this paper. A comparative study of the relative information transmitted...
متن کاملAcquiring Reusable Multilingual Phonotactic Resources
This paper presents a fully automatic procedure for acquiring reusable phonotactic resources from syllable annotated data. The procedure makes use of a regular inference algorithm and the acquired resources are stored in a specialised XML representation. The technique is then extended to support acquisition from phoneme labelled data while providing a semi-automatic annotation system assisting ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره cs.CL/0102020 شماره
صفحات -
تاریخ انتشار 2000